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Abstract 

The set of all m-tuples of compatible full conditional distributions on discrete ran- 
dom variables is an algebraic set whose defining ideal is a unimodular toric ideal. 
We identify the defining polynomials of these ideals with closed walks on a bi- 
partite graph. Our algebraic characterization provides a natural generalization of 
the requirement that compatible conditionals have identical odds ratios and holds 
regardless of the patterns of zeros in the conditional arrays. 
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1 Introduction 



Statisticians have long been interested in combining marginal and conditional 
distributions in order to compl etely specify a joint distribution. Such mod- 



els appear in spatia l statistics (IBesagl 119741 ) . analysis o f contingency t ables 



1 r V i "i I — ■ — i 

(Bis hop et all 119751). Bavesian pr ior elicitation contexts f)Q'Haganlll998f). ex- 



1999), statistical disclosure limitation (|Slavkovic 



pert systems (|Cowell et al 
2004), and generally in any area of applied statistics where one wishes to 



build global statistical models from local information about sub collections of 
random variables. 



Two fundamental theoretical questions which have been addressed in the lit- 
erature are the compatibility of conditionals and marginals, and the unique- 
ness of the joint distribution when it exists. A collection of conditionals and 
marginals are compatible if there exists a joint distribution with these condi- 
tionals and margin als. The first major result along these lines is the Hammersley- 
Clifford theorem (|Besad . I19T4T ) which establishes a connection between the 
positive joint probability distribution and the full conditionals: it provides 
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sufficient conditions for compatibility of distributions in the setting of Markov- 
random fields. In the discrete case, the Hammersley-Clifford theorem applies 
when there are no cells with zero probability. Questions about the uniqueness 
of the joint distribution given a compatible c o llectio n of marginals and condi- 
tionals were addressed in lGelman and S peed (1993) and the results presented 



Arnold et all ljl99flj) 



there were subsequently generalized in I Arnold et al. (1999). 
also address the question of whether the given set of densities are compatible; 
they describe a variety of compatibility conditions for the case of finitely many 
discrete random variables and give algorithms for checking the compatibility 
of a set of conditional probability densities. 



Let Xi, . . . ,X n be discrete random variables where the set of possible states 
for each Xi is in the set of integers [dj\ := {1, 2, ... , di}. The joint probability 
distribution p(x) = p(Xi = i%, . . . ,X n = i n ) can be represented by a d\ x 
■ • • x d n array of nonnegative real numbers that sum to one. We define a 
full conditional as a conditional multi-dimensional array p(xa\%b) where A U 
B = [n] and A n B = so that the conditional array depends on all the 
random variables. The precise statement of the compatibility problem for full 
conditionals is the following: 

Problem 1 Given partitions (Ai, B\), . . . , (A m , B m ) of[n] and arrays C l . . . C m 
each of format di X • • • X d n , when is there a joint probability distribution p(x) 
such that p(xAi\xBi) = C % for all i? 

The compatibility problem makes sense for any set of conditionals and marginals, 
but we shall see a particularly simple combinatorial solution when we restrict 
attention to compatibility among full conditionals. 



Given fixed values of the conditional and marginal arrays, the problem of de- 
termining the existence of a joint probability distribution can be phrased as a 
feasibility question for linear programs. Since marginal and conditional arrays 
are defined by linear and linear-fractional constraints in terms of the joint 
probabilities, the existence of a joint given the conditionals c an be decided 



by de termining whether or not a certain polytope is nonempty (j Arnold et al. 



1999). While this solution has many attractive features (the most important 



of which is the ability to include extra param eters to account for data that 



are "almost compatible" (| Arnold et al 



1999)), it does not give an intrinsic 



characterization of those sets of arrays which correspond to compatible con- 
ditional distributions. In particular, using linear programming to determine 
whether or not a set of conditionals are compatible requires the introduction 
of extra parameters. 



In this paper, we develop intrinsic methods for determining compatibility of 
conditional matrices, in the presence of zero cell entries. We describe explicit 
algebraic restrictions on the arrays C 1 , . . . , C m which guarantee that the pre- 
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scribed full conditional distributions are compatible. The results can be inter- 
preted statistically in terms of conditions that replace generalized odds ratios 
to allow for arbitrary patterns of zeros. In this sense, our results provide a 
key link between the statistical notions of odds ratios and tools from algebraic 
geometry. 

Our main result is the following theorem. 

Theorem 2 An m-tuple of d\ x • • • x d n arrays C 1 , . . . , C m are compatible 
full conditionals for the partitions (A 1: Bi), . . . , (A m , B m ) if and only if they 
satisfy the following four conditions: 

(*) ( "r,-j- - f° r al1 i and 3^32, ■ ■ ■ ,3n- 

(2) IfC}° v .. jn = for some i 0; then C)^ = for all i. 

(3) For each i the Bi margin of C % is an array of all ones. 

(4) The entries of the arrays C l satisfy polynomial conditions which are in 
bisection with the induced circuits of a bipartite graph that depends on the 
d 1 ,d 2 , ...,d n and (A u Bi), (A m ,B m ). 



The outline for the rest of the paper is as follows. In the next section, we 
provide a detailed description of our results in the case of two random vari- 
ables. We give precise defining relations for the space of compatible condition- 
als which generalize the condition that odds ratios of consistent conditionals 
should be the same. In particular, we give necessary and sufficient polyno- 
mial conditions for two conditionals to be compatible. In the third section, 
we describe the theoretical tools from com binatorial comm utative algebra 
( Sturmfelsl . Il995h and discrete optimization (|Schriiverl . Il998| ) which we will 
need to prove Theorem 2. In the fourth section we provide a general proof 
that the space of compatible full conditionals is a unimodular toric variety. 
The fifth section is devoted to a few trivariate examples to illustrate the main 
theorem. 



2 Two random variables 



In this section, we give a detailed explanation of our algebraic results in the 
case of the bivariate compatible conditional problem. We refer the reader to 
( Cox et al. . 1996i l for the basics of algebraic geometry. We relegate technical 
definitions and proofs of the main theorems to the third and fourth sections. 
We denote the two potential conditional arrays by 



C ij =p{X 1 = i\X 2 =j) 



p{X 1 =i,X 2 =j) 
p(X 2 =j) 



and 
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= = j I* = = p(Xi - • 

This yields the following parametrization of the space of compatible condi- 
tionals. 

Proposition 3 Two nonnegative arrays C and D represent compatible con- 
ditionals for two discrete random variables if and only if there are nonnegative 
parameter arrays Py, Ui, Vj such that 

J2i Cij = 1 for all j and J2j ^ij = 1 f or a ^ 

Statistically, the array Py represent a joint distribution which has C and D 
as its corresponding conditionals. Arrays U{ and Vj are marginals which when 
combined with appropriate conditionals give Py. This representation says that 
the set of all arrays C and D which are compatible conditions is determined by 
allowing the arrays P, u and v to range over all nonnegative values which make 
C and D into conditional arrays. The given parametrization is by polynomial 
functions in the parameters, and hence there exist polynomials in the Cy and 
Dij which vanish if and only if the pair C and D belong to the (closure of the) 
space of compatible conditionals. In the literature of discrete optimization 
and toric ideals, the parametrization defined in Proposition 3 is known as 
the Lawrence lifting of the Segre variety. The equations which vanish on this 
parametrization are well understood. 



Theorem 4 (e.g. Sturm fell. 199 A. Ch. 14) The defining equations for the 



Lawrence lifting of the Segre variety are in bisection with circuits in the com- 
plete bipartite graph K dl>d2 . Each circuit of length 2r defines a binomial of 
degree 2r in the and D^. 

This bijection is described as follows. Let 12,32, ■ ■ ■ , K,3r, h) be a circuit 

in K dl>d2 of length 2r. This circuit produces a binomial of degree 2r: 

nun n d e n c 

Note that the degree four relations which are produced by this construction 
are precisely the relations which say that all the odds ratios of the matrices C 
and D must be the same. For example, the circuit of length 4 in K 2 ^ produces 
the binomial: 

C11C22-D12-D2I — C\2C2lDnD22- 

For positive C and D, this binomial is zero if and only if the following odds 
ratios are equivalent: 

C11C22 _ D11D22 
C12C21 -D12-D21 
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It is well known that for positive matrices C and D, compati bility of con 



dition al matrices is equivalent to equality of the odds ratios ([Arnold et al 



1999). In algebro-geometric terms, this says that the degree four polynomi- 
als produced by the circuit construction define the toric variety of interest 
set theoretically in the strictly positive orthant. However, this statement be- 
comes false when some of the entries in C and D are allowed to be zero. 
These polynomials, in addition to addressing the compatiblity of arrays C 
and D in the presence of zero entries, also should be useful for determing the 
simil arity of linear contr asts in the C and D matrices when there are zero 
cells ( Bishop et al. . 1975| ). Thus, these polynomials could prove useful for the 
analysis of incomplete contingency tables. 



Example 5 Consider the matrices 



C 



and D 



(l 2 

3 3 





\ 2 2 J 



1 2 
3 3 



1 



\3 U 3/ 



All of the odds ratios of these matrices are the same: they are either simul- 
taneously zero or undefined. In other words, they saitisfy all 9 polynomials 
which are described by circuits in K 33 of length 4- However, the matrices do 
not represent compatible conditionals since the binomial given by the length 6 
circuit (1, 1, 2, 2, 3, 3, 1): 

C11C22C33-D12-D23-D31 — C12C23C31-D11-D22-D33 

is equal to 1/108 a nd not zero. These incompatible conditionals appear in 
Arnold et all \l999\) . 



These polynomials give a parameter free method for determining if two arbi- 
trary conditional arrays are compatible. 

To make the connection to the next section more lucid, we will introduce 
an alternate graph theoretic representation of the circuit binomials described 
above. This more complicated description is the one that generalizes for arbi- 
trary sets of full conditionals in the next section. 

Definition 6 We define the bipartite graph Gd ly d 2 ^° ^ e ^ e Q^ph with d\d2 + 
d\ + g?2 vertices as follows. 

(1) There are d\d^ vertices labeled where i £ [d\] and j £ [d 2 ] ■ 

(2) There are d\ vertices labeled Wk where k £ [di\. 

(3) There are d 2 vertices labeled m where I £ [rf 2 ]. 

(4) There is an edge between Vij and Wk if and only if i = k. 

(5) There is an edge between and u\ if and only if j = I. 
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11 12 13 21 22 23 




12 12 3 

Fig. 1. The bipartite graph C?2,3 



(6) There are no other edges in Gd 1> d 2 - 

The graph G 2 ,3 is pictured in Figure 1. Associated to any circuit in Gd 1> d 2 °f 
length 2r we get a binomial of degree r as follows. After permuting indices, 
circuits in Gd lt d 2 have the form 

(kji,H, H32, 32, i-232, ■ ■ ■ ,jr, iv3v) 
where %\ = i r and j\ = j r . From this circuit we recover the binomial 
c n c . . ■ n — n c n . . . c 

^ ^1231^ '1232 ^lljr - Ly tl3l*~ , t23l ±y l232 ^HJr, 

which was the same binomial from our previous description. Putting all these 
ideas together we have the following special case of Theorem 2. 

Corollary 7 A pair of matrices C and D are compatible conditionals if and 
only if they satisfy the following properties: 

(1) C > and D > 0, 

(2) for all i and j , Cij = if only if Dij = 0, 

(3) J2i Cij = 1 for all j and J2j = 1 for all %, and 

(4) C and D satisfy the circuit polynomials associated to the circuits in the 
graph G dlida . 



3 Unimodular toric varieties 

In this section, we will describe the mathematical objects that we will need to 
describe the space of compatible conditionals. First, we show that the space 
of compatible conditionals is a parametrized algebraic set. The parametriza- 
tion is given by monomials and this leads naturally to the definition of toric 
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varieties and toric ideals. Then we introduce the notions of graphical unimod- 
ular matrices and unimodular toric varieties, which are the natural algebraic 
objects for representing the space of compatible conditionals. 

A key preliminary observation is the following: 

Theorem 8 A set of full conditional matrices C 1 , . . . , C m are compatible con- 
ditionals corresponding to the partitions (A ± , E>i), . . . , (A m , B m ) if and only if 
there are matrices of probabilities U 1 , . . . , U m of the appropriate sizes and a 
d± x • • • x d n matrix of probabilities P such that 

P ■ = C i ■ iP 

31,—,3n jl, — ,jn jk 1 ,--;jk 3 

for all i where {k\ < . . . < k s } = B>i. 

PROOF. This is the definition of compatibility together with Bayes' rule. 
The matrices U l are then the "missing" ^-marginal distributions. □ 

Since we are interested in intrinsic characterizations of the compatible condi- 
tionals, it makes sense to rearrange this expression to deduce an equation for 
the C l in terms of the missing joint and conditional distributions P and U l . 
Thus we deduce the following corollary. 

Corollary 9 The set of all m-tuples of consistent full conditional arrays C 1 , 
. . ., C m is described parameterically as 

C l — P TP 
where the multiplication is componentwise. 

At first glance, the space of conditionals we have described in Corollary 9 
looks highly over-parameterized: shouldn't the U parameters be related to the 
P parameters by marginalization? 

We claim that we can first allow the parameters P and U l to be arbitrary 
real arrays, consider the variety which is defined by this parameterization, 
and then intersect this variety with the appropriate product of probability 
simplices. The result of this procedure will be the same set of conditional arrays 
which is obtained by first restricting the parameters so that LP = p(xb) and 
then considering the resulting parametrically described variety. To see this, 
observe that the only way that C l could be a conditional array is if U l is the 
corresponding marginal p(xb), which again follows by Bayes' rule. 

Let p(xa 1 \xb 1 ) an d p{xa 2 \xb 2 ) be two full conditional matrices. Note that 
if B 2 C B>i, then they are compatible if and only if p{xa 1 \xb 1 ) is obtained 
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from p(xa 2 \xb 2 ) by conditioning on the variables A 2 \ A%. Hence when de- 
termining the space of consistent conditionals for p(xa 1 \xb 1 ), p( x a 2 \%b 2 ), ■ ■ •> 
p{x A m \x B m ) i we may assume that there are no containment relations among 
the A4 or Bi. That is, the collection A = {Bi, . . . , B m } can be taken to be the 
facets of a simplicial complex. 

We will work in the polynomial ring 

R = R[C] lr „ )in |l < i < m and 1 < j a < d a , for alls] 

which is a polynomial ring in D = mdydi • • ■ d n indeterminates. Each inde- 
terminate in the ring R corresponds to an entry in one of the m conditional 
matrices. By Corollary 9, the algebraic variety in MP, which, when intersected 
with a product of probability simplices, yields the space of consistent condi- 
tional, is given by the monomial parameterization 

f™ = P ■ TP 

^JU-jn n,-,jn u j kl ,...,j ks 

where {k\ < k 2 ■ ■ ■ < k s } = Bi and the P's and £Ts are allowed to range over 
all of the real numbers. 

Definition 10 An algebraic variety which is given by a monomial parameter- 
ization is called a toric variety. 

To describe the polynomials that define any toric variety amounts to under- 
standing which products of the parameterizing monomials are equal to which 
other products of the parameterizing monomials. That is, the toric ideal which 
defines the toric variety is generated by binomials (differences of monomials). 

Definition 11 The toric ideal which is the vanishing ideal of the space of 
compatible conditionals is denoted I a which we call the compatibility ideal. 

Furthermore, understanding which monomials are equal to each other can 
be studied by understanding the integer kernel of an associated matrix. A 
standard reference for the theory of toric ideals is 

Definition 12 Let A G Z dxn be an integer matrix. The toric variety associ- 
ated to A is defined by the parameterization 

™ fiii,a 2 i j.ddi 

Xj — ii t 2 ■ ■ ■ L d • 

The toric ideal I a which defines this parameterized variety is generated by the 
(infinitely many) binomials 

I A = ( x u -x v \Au = Av) . 
The presentation of a toric ideal I a in Definition 12 is given with infinitely 



Stunnfel4 ll995). 
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many ideal generators. However, the Hilbert basis theorem implies that only 
finitely many of these polynomials are actually needed in a generating set of 
the ideal. In many situations, the difficulty of studying toric ideals amounts to 
finding good combinatorial descriptions of this finite set of generators. In the 
case of the compatibility ideals Ja there is a simple combinatorial structure 
called unimodularity which can be used to describe the generators. 

Definition 13 A toric ideal I a is called unimodular if every reduced Grobner 
basis of I a consists of squarefree binomials (i.e. there are no squared variables 
in the leading terms of each binomial in the reduced Grobner basis). Equiva- 
lent^, I a is unimodular if all the non-zero minors of A have the same absolute 
value. Such a matrix A is also called unimodular. 



The foll owing is a n impo rtant special family of unimodular matrices (see, for 



example ISchriiverl (199 



Proposition 14 Let A be a matrix with the following properties: 

(1) All the entries of A are either zero or one, 

(2) there are precisely two l's in each column of A, and 

(3) there is a partition of the rows of A into two sets U and V such that each 
column of A has exactly one nonzero entry with row index u EU and one 
nonzero entry with row index v G V . 

Then A is a unimodular matrix and is called a graphical unimodular matrix. 

The reason for the name "graphical" is due to the fact that any graphical 
unimodular matrix A is the vertex-edge incidence matrix of a bipartite graph 
Ga- The parition of the vertices of Ga corresponds to the partition of the rows 
of A. Notice that vectors v G ker(^4) correspond to the union of cycles in the 
graph G A - 



Definition 15 Let G be any graph. An induced circuit of G is a circuit of G 
which does not have a chord (i.e. a "short cut") in G. 



In the complete bipartite graph K 3 3 all of the cycles of length 4 are induced, 
whereas none of the six cycles are induced because there is a chord cutting 
across them. Toric ideals which are presented by a graphical unimodular ma- 
trix have a simple combinatorial description for their minimal generators in 
terms of the induced circuits of the associated bipartite graph . A version of 
the following result can be found in ( Aoki and Takemura . 2002j ). 



Proposition 16 Let A be a graphical unimodular matrix, and Ga the as- 
sociated bipartite graph. Let the U and V be the partition of the vertices of 
Ga where U = {ui, . . . , ui t } and V = {vi, . . . , vi 2 }. For each circuit c = 



9 



(u^, Ui 2 , Vj 2 , . . . , Ui r ,Vj r , UiJ we associate the circuit binomial 

fc = x UiiVj ^x Ui ^ Vj2 ■ ■ ■ x UirVjr — x Ui ^ Vji x UisVj2 ■ ■ -x UiiVjr . 

Then I a is minimally generated by the circuit binomials corresponding to the 
induced circuits oJGa- That is, 

I A = (fc\ c is an induced circuit of G a) ■ 



4 The main algebraic result 

Now that we have reviewed all the mathematical facts we need about unimod- 
ular toric ideals, we are in a position to prove that the compatibility ideal is 
a unimodular toric ideal. To do this, we construct the matrix that represents 
this toric ideal. 

Let d — (d\, . . . , d n ) be the integer vector corresponding to the dimensions of 
the tables let A = {B±, . . . , B m } be the simplicial complex which represents 
the compatible conditionals. For each facet Bi of A, denote by d B . the vector 

dBi = (rffci, • • • , d ks ) 
where Bi = {k±, . . . , k s } and denote by D Bi the product 

D Bi = 114, 

kiEFi 

We denote by A^d the matrix that represents the compatibility ideal corre- 
sponding to this data. This matrix is a 

m 

(did 2 ---dn + Y^ d Bl ) x md\d 2 • ■ ■ d n 
i=i 

matrix. The rows are naturally grouped into m + 1 blocks and the columns are 
natrually grouped into m blocks. Each column is labelled by an integer i G [m] 
and an n-tuple (ji, . . . , j n ) G [di] x [d 2 \ x • • • x [d n j. The first block of rows is 
labelled by the integer and n-tuple . . . , j n ) G [di] x [d 2 \ x • • • x [d n ]. Each 
of the remaining blocks of rows is labelled by and integer % and an \Bi\ -tuple 
(jki, ■ ■ -Jk s ) e [dkj] x • • • x [d ks ] where B t = {h, k s }. 

The entries of this matrix are all ones and zeros. There is a one in a particular 
entry with row indexed by the data i 1 , . . . and column indexed by the 
data i 2 , (jf, . . . if and only if it satisfies the following rules: 

(1) if the row label is and ...,j}) = (jf, . . or 
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(2) if the row label is i 1 > 0, i 1 = i 2 , and (j^, . . . ,j\ s ) = (j^, . . . , jfj where 
Bii = (fa, k s ). 

Theorem 17 The matrix A^ d is a graphical unimodular matrix and it rep- 
resents the toric variety of the space of compatible conditionals. Hence, the 
compatibility ideal is generated by the induced circuit binomials in the associ- 
ated bipartite graph. 



PROOF. To see that A^d represents the toric variety of compatible con- 
ditionals amounts to identifying the rows of the matrix with a parameter, 
and the columns of the matrix with an entry in the conditional matrix. The 
labelling for the columns i, (j±, . . . ,j n ) naturally corresponds to the indetermi- 
nate Cj jn . A row in the first block, with i — 0, corresponds to the Pj 1: ...,j n 
parameters. A row in any of the blocks with % > corresponds to the param- 
eter U] a . Note that be the description of the matrix A\ a, in the column 
corresponding to - n there are precisely two nonzero entries: one in the 

first block which corresponds to Pj u ...,j n and one in block i, which corresponds 
to Uj k 5 j k . Thus, this represents the parameterization 



pi p jji 

^jl-jn 1 31-3n U j kl -j ka 



as desired. 



Now we wish to show the unimodularity of A^ d , however, this is an immediate 
consequence of the preceding argument. Each column has precisely two ones: 
one with index % — and one with index % > 0. Hence, the value of % determines 
the partition of the rows to deduce the structure of the underlying bipartite 
graph. □ 



Now we will explicitly describe the graph encoded by the matrix A^ d . 
Definition 18 The graph G^d associated to the compatibility ideal has 

m 

(did 2 •••d n + J2 d F>) 
i=i 

vertices and md\di ■ ■ • d n edges: these are the number of rows and columns 
°f A^d respectively. The vertices are partitioned into two class: those labelled 
with i — and some (Ji, ■ ■ ■ ,j n ), an d those labelled with i > and some 
(jfci, • • • , jk s )- Vertices with i = are incident only to vertices with i > and 
conversely. In particular, the vertex i = 0, (j\, . . . is incident to i > 0, 

Uli » • • • Ji) if and onl v l f (ife! ■■■Jk.) = (ill > • • • Jk. ) ■ 

To close this section, we now provide the proof of Theorem 2. 
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PROOF of Theorem 2: Conditions (l)-(3) of Theorem 2 are clearly nec- 
essary. Condition (4) is simply the expression of Theorem 17 above. What 
remains to show is that this conditions are, in fact, sufficient. Suppose that a 
particular realization of conditional arrays C 1 , . . . , C m satisfy conditions (1) - 
(4). We wish to show that there exists parameter matrices P and [/,..., U m , 
which have C 1 C m as their image. By conditions (1), (3), and (4) to- 
gether with a result of iGeiger. et al. this can happen if and only if 
the columns of A^ which are indexed by the support of C 1 , . . . , C m form 
a nice facial subset of the columns of A^. The nice facial subsets of the 
columns of A^.d are precisely those sets of columns which are obtained by 
taking a collection of rows R of Aa,<i, and taking exactly those columns of 
Aa^ which have zeros in the rows R. However, the support of C 1 , . . . , C m is a 
nice facial subset of the columns of A^d because of condition (2): the rows of 
Aa,cI to choose are precisely those rows labelled by (0, (ji, . . . ,j n )) for all the 
indices . . . ,j n ) where the _ - n are collectively zero. □ 



5 Trivariate examples 



We will conclude our paper with some examples on three binary random vari- 
ables to illustrate the construction and the types of polynomials that appear. 

Example 19 Consider the compatibility ideal associated with three binary 
random variables and suppose that A = {{1}, {2}, {3}}. In other words, we 
are considering the compatibility of p(x\, X2\xs),p(Xi, a^la^) and p(x2, x%\xi). 
The matrix, Aa, (2,2,2) has 14 rows and 24 columns. It is the zero /one matrix 
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The graph corresponding to this compatibility problem has 14 vertices: 8 of one 
type and 6 of the other, and 24 edges between them. It is pictured in Figure 2. 



There are three types of induced circuits in this graph, yielding three types of 
binomial generators of the compatibility ideal I&. We use the indeterminates 
Cijk, Dij k and to denote the corresponding entries in the conditional ar- 
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Ill 112 121 122 211 212 221 222 




cl c2 dl d2 el e2 

Fig. 2. The compatibility graph for p(x±, X2\xs),p(xi, xs\x2),p(x2, xs\xi) 

rays. For instance, the circuit (111, ci, 211, d±, 111) in the graph is induced and 
from it we deduce that the quadratic binomial 

C111-D211 — -D111C211 

is a minimal generator of the compatibility ideal I a ■ The induced circuit 
(111, ci, 221, e2, 212, di, 111) produces the binomial 

Clll -£'221-0212 — C221-E-212-Dlll 

which is also a minimal generator of I a- toto/, i/iere are 12 induced circuits 
of length 4, 8 induced circuits of length 6, and 60 induced circuits of length 8. 
Hence I a is minimally generated by 12 quadric, 8 cubics, and 60 quartics. 

Example 20 Consider the compatibility associated to 3 binary random vari- 
ables with the simplicial complex A = {{1, 2}, {1, 3}, {2, 3}}. That is, we are 
considering the compatibility ofp(x\y,z), p(y\x,z), and p(z\x,y) . Use the in- 
determinates C^, D^, and to represent entries in the corresponding 
conditional arrays. The ideal I a is generated by 28 binomials which fall into 4 
equivalence classes modulo the symmetry of the cube. In this case, the binomial 
minimal generators of I a are in bisection with the circuit in the edge graph of 
the cube. The four different types of circuit on the cube are indicated in figure 
3. 




Fig. 3. The combinatorial types of circuits on the cube 
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These four types of circuits yield four different symmetry classes of binomials 
in the ideal which are represented by the following binomials: 

C111C221-D121-D211 — C121C211-D111-D221 

Clll-D211-E'221-D222C212-Ell2 — C2II-D22I-E222-D212C112-E1II ^ 
Clll-D211-E l 22lC222-Dl22-E'll2 — C2II-D22I-E222C122-D112-E1II , 
^111^211^221^12lQL22^222^212-E'll2 — ^211^221^121^122^222^212Ql.l2-E'lll • 

./Vote i/ierf £/ie /irst polynomial in this list has a natural statistical interpretation: 
it corresponds to the equality of the odds ratios for p(x\y, z) and p(y\x,z). 

While there are statistical interpretations for some of the circuit polynomials 
which arise, understanding these polynomial expressions in relation to known 
statistical quantities remains an open problem. In particular, determining how 
tools from the analysis of contingency tables such as generalized odds ratios 
and linear contrast relate to the polynomial constraints we have derived could 
prove useful for inference on incomplete contingency tables. 
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